The library scikit-fda is a Python package for Functional Data Analysis (FDA). It provides a comprehensive set of tools for representation, preprocessing, and exploratory analysis of functional data. The library is built upon and integrated in Python's scientific ecosystem. In particular, it conforms to the scikit-learn application programming interface so as to take advantage of the functionality for machine learning provided by this package: pipelines, model selection, and hyperparameter tuning, among others. The scikit-fda package has been released as free and open-source software under a 3-Clause BSD license and is open to contributions from the FDA community. The library's extensive documentation includes step-by-step tutorials and detailed examples of use.
translated by 谷歌翻译
可以部署一组合作的空中机器人,以有效地巡逻地形,每个机器人都会在指定区域飞行,并定期与邻居共享信息,以保护或监督它。为了确保鲁棒性,以前对这些同步系统的作品提出了将机器人发送到相邻区域的情况,以防它检测到故障。为了处理不可预测性并提高确定性巡逻计划的效率,本文提出了随机策略,以涵盖在代理之间分配的领域。首先,在本文中针对两个指标进行了对随机过程的理论研究:\ emph {闲置时间},这是两个连续观察到地形的任何点和\ emph {隔离时间}之间的预期时间,预期的时间},预期的时间机器人没有与任何其他机器人通信的时间。之后,将随机策略与添加另一个指标的确定性策略进行了比较:\ emph {广播时间},从机器人发出消息的那一刻,直到团队的所有其他机器人收到消息。模拟表明,理论结果与模拟和随机策略的表现非常吻合,其行为与文献中提出的确定性协议获得的行为相比。
translated by 谷歌翻译
大型深度神经网络的联合培训通常可以受到限制,因为将更新与增加模型大小进行交流的成本增加。在集中设置中设计了各种模型修剪技术,以减少推理时间。将集中的修剪技术与联合培训相结合似乎是降低沟通成本的直观 - 通过在沟通步骤之前修剪模型参数。此外,在培训期间,这种渐进的模型修剪方法也可以减少培训时间/成本。为此,我们提出了FedSparsify,该公司在联合培训期间执行模型修剪。在我们在集中式和联合的设置中对大脑年龄预测任务的实验(估计一个人的年龄从大脑MRI估算),我们证明,即使在具有高度异构数据的高度异质数据的挑战性的联盟学习环境中,也可以将模型最多可修剪高达95%的稀疏性,而不会影响表现。分布。模型修剪的一个令人惊讶的好处是改进的模型隐私。我们证明,具有高稀疏性的模型不太容易受到会员推理攻击的影响,这是一种隐私攻击。
translated by 谷歌翻译
智能测试的分辨率,特别是数值序列,对AI系统的评估引起了极大的兴趣。我们提出了一种称为Kitbit的新计算模型,该模型使用简化的算法及其组合来构建一个预测模型,该模型在数值序列中找到了基础模式,例如IQ测试中包含的模型以及更复杂的其他模型。我们介绍了该模型的基础及其在不同情况下的应用。首先,对从各种来源收集的智商测试中使用的一组数字系列进行了测试。接下来,我们的模型已成功应用于用于评估文献报道的模型的序列。在这两种情况下,系统都可以使用标准计算能力在不到一秒钟的时间内解决这些类型的问题。最后,Kitbit的算法首次应用于著名的OEI数据库的整个序列的完整集。我们以算法列表的形式找到了一个模式,并在迄今为止最大的系列数量中预测了以下术语。这些结果证明了kitbit解决可以用数值表示的复杂问题的潜力。
translated by 谷歌翻译
许多科学预测问题在使用稀疏和不均匀分布的观测中处理空间和时间的复杂变化方面具有时空数据和建模相关的挑战。本文提出了一种新颖的深度学习架构,对位置依赖的时间序列数据(DEEPLatte)的深度学习预测,明确地将空间统计的理论纳入神经网络以解决这些挑战。除了特征选择模块和时空学习模块之外,Deeplatte还包含一个自相关引导的半监督学习策略,以强制执行学习的时空嵌入空间中的预测的本地自相关模式和全局自相关趋势,以与观察到的数据一致,克服了稀疏和不均匀分布式观测的限制。在培训过程中,监督和半监督亏损指导整个网络的更新:1)防止过度装备,2)优化特征选择,3)学习有用的时空表示,4)改善整体预测。我们在一位良好的公共卫生主题,空气质量预测中,使用公共公共卫生主题,在学习,复杂的身体环境中进行了展示Deeblatte的演示 - 洛杉矶。该实验表明,该方法提供准确的细空间尺度空气质量预测,并揭示了影响结果的关键环境因素。
translated by 谷歌翻译
基因表达数据集通常具有高维度,因此需要有效且有效的方法来识别其属性的相对重要性。由于可能的解决方案的搜索空间的大小,属性子集评估特征选择方法往往不适用,因此在这些方案中使用特征对方法。文献中描述的大多数特征排名方法是单变量的方法,因此它们不会检测因子之间的相互作用。在本文中,我们提出了基于成对相关性和成对一致性的两种新的多变量特征排名方法,我们应用于三种基因表达分类问题。我们在统计上证明所提出的方法优于现有技术的状态,特征对方法进行分类方法聚类变化,CHI平方,相关性,信息增益,相关性和意义,以及基于与多目标的相关性和一致性的属性子集评估的特征选择方法进化搜索策略。
translated by 谷歌翻译
检测局部特征,例如角落,段或斑点,是许多计算机视觉应用中的第一步。它的速度对于实时应用至关重要。在本文中,我们在文献中呈现elsed,最快的线段探测器。其效率的关键是局部段生长算法,其在存在小不连续性的情况下连接梯度对齐的像素。所提出的算法不仅在具有非常低端硬件的设备中运行,而且还可以参数化以促进短期或更长的段的检测,具体取决于手头的任务。我们还介绍了新的指标,以评估段探测器的准确性和重复性。在我们的实验中,我们证明我们的方法账户最高的重复性,它在文献中最有效。在实验中,我们量化了此类收益所交易的准确性。
translated by 谷歌翻译
Ithaca is a Fuzzy Logic (FL) plugin for developing artificial intelligence systems within the Unity game engine. Its goal is to provide an intuitive and natural way to build advanced artificial intelligence systems, making the implementation of such a system faster and more affordable. The software is made up by a C\# framework and an Application Programming Interface (API) for writing inference systems, as well as a set of tools for graphic development and debugging. Additionally, a Fuzzy Control Language (FCL) parser is provided in order to import systems previously defined using this standard.
translated by 谷歌翻译
In this work a novel recommender system (RS) for Tourism is presented. The RS is context aware as is now the rule in the state-of-the-art for recommender systems and works on top of a tourism ontology which is used to group the different items being offered. The presented RS mixes different types of recommenders creating an ensemble which changes on the basis of the RS's maturity. Starting from simple content-based recommendations and iteratively adding popularity, demographic and collaborative filtering methods as rating density and user cardinality increases. The result is a RS that mutates during its lifetime and uses a tourism ontology and natural language processing (NLP) to correctly bin the items to specific item categories and meta categories in the ontology. This item classification facilitates the association between user preferences and items, as well as allowing to better classify and group the items being offered, which in turn is particularly useful for context-aware filtering.
translated by 谷歌翻译
Detecting anomalous data within time series is a very relevant task in pattern recognition and machine learning, with many possible applications that range from disease prevention in medicine, e.g., detecting early alterations of the health status before it can clearly be defined as "illness" up to monitoring industrial plants. Regarding this latter application, detecting anomalies in an industrial plant's status firstly prevents serious damages that would require a long interruption of the production process. Secondly, it permits optimal scheduling of maintenance interventions by limiting them to urgent situations. At the same time, they typically follow a fixed prudential schedule according to which components are substituted well before the end of their expected lifetime. This paper describes a case study regarding the monitoring of the status of Laser-guided Vehicles (LGVs) batteries, on which we worked as our contribution to project SUPER (Supercomputing Unified Platform, Emilia Romagna) aimed at establishing and demonstrating a regional High-Performance Computing platform that is going to represent the main Italian supercomputing environment for both computing power and data volume.
translated by 谷歌翻译